Towards a Framework for Observing Artificial Evolutionary 

Systems 

Janardan Misra 
HTS Research, Bangalore, India 560076 
Email: janardan.misra@honeywell.com 

Contents 



Q\ 1 Background [2j 

1.1 Motivations [2] 

1.2 Contributions [3j 

Ch 2 The Framework [3] 

I ^ 2.1 The Formal Structure of the Framework [3] 

2.1.1 Observation Process and the Model Universe IU 

2.1.2 Entities and Their Characteristics 131 

2.1.3 Distance Measures [S] 

2.1.4 Observable Limits on Mutational Changes [5] 

2.2 Evolutionary Components [7] 

£h 2.2.1 Mutations 

2.2.2 Reproduction [7] 

2.2.3 Heredity HH 

2.2.4 Natural Selection [12] 

> 3 Case Studies [13] 

3.1 General Considerations [32] 

3.2 Case Study 1: Langton Loops Q~?] 

3.3 Case Study 2: Algorithmic Chemistry [TH] 

1 4 Related Work [20] 

o 

5 Conclusion [21] 

5.1 General Remarks [2T] 

5.2 Design Suggestions for ALifc Researchers [22] 

k> 5.3 Limitations [22] 

5.4 Further work [23] 

Abstract 



Establishing the emergence of evolutionary behavior as a defining characteristic of 'life' is a 
major step in the Artificial life (ALife) studies. We present here an abstract formal framework 
for this aim based upon the notion of high-level observations made on the ALife model at hand 
during its simulations. An observation process is defined as a computable transformation from the 
underlying dynamic structure of the model universe to a tuple consisting of abstract components 
needed to establish the evolutionary processes in the model. Starting with defining entities and their 
evolutionary relationships observed during the simulations of the model, the framework prescribes 
a series of definitions, followed by the axioms (conditions) that must be met in order to establish 
the level of evolutionary behavior in the model. The examples of Cellular Automata based Langton 
Loops and A calculus based Algorithmic Chemistry are used to illustrate the framework. Generic 
design suggestions for the ALife research are also drawn based upon the framework design and case 
study analysis. 

Keywords: Artificial Life, Evolution, Observations, Formal Framework, Evolutionary Processes. 
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1 Background 



The phenomenon of "life" on earth is one of the most intriguing one with vast variety and complexity of 
forms in which it is found on multiple levels ranging from microbiological scale to higher taxa exhibiting 
a wide array of characteristics. Although we can explain several aspects of life around us in the light 
of existing theories for real-life evolution, we do not yet have a comprehensive understanding of the 
principles underlying the emergence of life and the conditions that led to the diversity and complexity of 
life on earth |Fut98j . Experimental methods to understand biological processes are usually difficult and 
error prone because living systems are by nature complex in design and usually hard to manipulate. 
Evolution is even more difficult to study experimentally since experiments may span over several 
generations and are usually difficult to control. 

Artificial life (ALife) is an elegant methodology to complement real life theories to study the prin- 
ciples underlying the complex phenomena of life without directly working with the real-life organisms. 
For example, ALife studies can complement theoretical biology by uncovering detailed dynamics of 
evolution where real life experiments are not possible, and by developing generalized formal models for 
life to determine criterions so that life in any arbitrary model can be observed. 

Cellular Automata based models are one of the earliest attempts of synthesis to understand the 
underlying logic of self reproduction [Sip98 . Later attempts in the field have considered several new 
kinds of synthetic structures including programs, A terms, strings, graphs, automata's etc (see for 
overview [DZBOlJ and have demonstrated that one or the other observable properties of real-life are 
shared by all of these models, though the parallel diversity and robust evolving structures which we find 
in real-life are yet to be designed. One of the guiding principles of ALife research behind these novel 
class of synthetic structures is that - "life is a property of form and organization rather than the matter 
used to build it" [Lan95 . This criterion to identify life in these novel synthetic structures in turn poses 
further questions as to which kind of organizational structures possess life? Which properties should 
we be looking at in those structures? and most importantly how can we recognize life in any arbitrary 
model? 

To partly address these questions, in this paper, we proceed with the hypothesis that one of the 
possible ways life can be recognized in an arbitrary ALife model during its simulations is by observing 
population of entities undergoing evolution in the sprit of evolution by natural selection, which demands 
the presence of reproduction, heredity, variation owing to mutational changes, and finally natural se- 
lection based reproductive success. (See also l 'Daw82j). Though the criterion to equate life with the 
presence of evolutionary processes excludes other plausible properties including metabolism |BFF92j , 
complexity | AO COO] , self organization [Kau93] , autonomy and autopoicsis |Zcl8l], yet captures a wide 
class of interesting phenomena related to population level evolution of entities [SHOP] . Such a identi- 
fication of population level evolutionary phenomena in arbitrary ALife models critically depends upon 
the observations carried out over simulations as we discuss next. 

1.1 Motivations 

Observations play a fundamental role both in real life studies as well as in ALife research. In case of 
real-life studies the role of observations is usually limited to an experimental analysis to uncover the 
specific dynamics underlying the observed life forms and their properties using natural observations 
or controlled experiments. On the other hand, in case of ALife studies, in general there is no known 
method to decide beforehand the kind of entities, which might demonstrate non-trivial life-like behavior, 
without closely observing the simulations of the model. 

The very identification of life is thus an existential problem for ALife studies and we need some 
sound formal framework to address this problem. In absence of a formal framework, we often encounter 
intuitive and informal arguments, which remain useful only to specific models and do not always have 
the generic perspective. We question whether these model-specific arguments are sufficient to support 
the presence of an extremely complex phenomenon such as evolution in ALife models. Without formal 
foundations to ascertain these (informally presented) claims, there is always a danger to run into 
conflicting arguments, which might, for example, be based upon observations of the smulations on 
different levels. Nehaniv and Dautenhahn [ND98 specifically discuss that identification of time varying 
entities is a deep rooted problem in the context of formal definitions for self-reproduction and add that 
in absence of observers it is problematic to decide whether an instance of artificial self-replication be 
treated at all a life-like one. 
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In an attempt to provide a formal platform for observations in ALife studies, a high level abstrac- 
tion mechanism is presented for characterizing the observations needed to establish the evolutionary 
behavior in ALife studies. Initial concepts in this direction appeared in [Mis06a HM07]. The central 
concept of the framework is the formalization of the observation process, which we believe is essential, 
but most often remains implicit in ALife studies. The observation process leads to abstractions on the 
model universe, which are consequently used for establishing the necessary elements and the level of 
evolution in that model. Examples of Cellular Automata based Langton Loops (Section 3.2 1 and A 
calculus based Algorithmic Chemistry (Section 3.3) are used to demonstrate the applicability of the 
formalism. Importantly the framework does not build upon the low-level dynamics or the "physical 
laws" of the underlying universe of the particular ALife model at hand, and thus permits the study of 
higher-level observationally "emergent" phenomena as a basis of evolution. 



1.2 Contributions 

The paper brings the implicitly assumed notion of observations to be carried out independent of the 
underlying structure of the model into main focus of ALife studies. It was not clear before that 
observational processes can be independently studied in their own right and the work presented in 
this paper makes it clear by placing observations into distinct formal platform. The work can also be 
seen as an attempt to fulfill the need for explicitly separating the design of the ALife models from the 
abstractions used to describe their dynamic progression. 

The approach has helped us to formalize certain aspects of life including recognition of reproductive 
relationships under parental mutations as well as reproductive mutations in children along with their 
epigenetic developments, which were believed to be difficult to formalize before |ND98I INeh05j . The 
formalism captures wide range of reproductive instances including the case of multi parent reproduction 
(without resorting to the concept of species) , and the case of reproduction without overall growth of 
the population (cf. ND98 ). Finally framework design and analysis of the case studies are used to draw 
useful design suggestions for the ALife research so that interesting evolutionary phenomena involving 
life-like entities can be better synthesized and analyzed. 



The paper is organized as follows: In Section [2j we will formally elaborate the framework. Case 
studies will follow in Section[3]- Section [3~2| applies the framework on cellular automata based Langton's 
Loops and Section 3J5 on A calculus based Algorithmic chemistry. Section [4] presents a discussion of 
related work, and is followed by concluding remarks in Section [5] along with the discussion on design 
suggestions for the ALife researchers in Section |5.2| Limitations of the framework and pointers for 
further work are discussed in Sections |5.3| and |5.4| respectively. 



2 The Framework 

In the ensuing discussion, we will use "ALife model" and "model" , "Observation process" and "Ob- 
server" interchangeably to add convenience in presentation. Similarly "real-life" is used in the paper 
to refer to organic life on earth in contrast with the "artificial- life" . Also, Observer Abstractions will 
refer to specific observations and corresponding abstractions made upon the ALife model during its 
simulations. Axioms are used to specify conditions which need to be satisfied in order to infer various 
components of evolution. Thus for each fundamental component of evolution: self reproduction, muta- 
tion, heredity, and natural selection, framework specifies certain Axioms constraining what is needed 
to be observed and consequently inferred in a formal way if any claim towards presence of any of these 
evolutionary components has to be substantiated. The aim is to define these formal Axioms such that 
only valid claims for evolutionary processes in a model can be entertained. Auxiliary formal structures 
are used in the intermediate stages of analysis. E.g., distance measure for determining dissimilarity 



between entities for their specific characteristics (see Section 2.1.31 



2.1 The Formal Structure of the Framework 

To illustrate the framework, we will use a simple example of a binary string based chemistry whenever 
required in the discussion to assist the intuition behind the formalism. The chemistry will be referred 
as CBS (Chemistry of Binary Strings). Specifics of the design and structure of the chemistry will be 
explained as we proceed. 
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2.1.1 Observation Process and the Model Universe 

We define the observation process as a transformation from the underlying universe of the ALife model 
to a set of observed abstractions as follows: 



Observation Process. T ^obj II: An observation process Obj is defined as a computable trans- 
formation from the underlying model structure T = (E,T) to observer abstractions II = (E, F, T, D, 
Smut, S rep rnu t, C) and represented as T *-^obj II. T, and II are defined below. 

The condition of computability is to ensure that the framework is decidable (or feasible Bcd99J), 
that is, the observation process only involves feasible computable steps, which can also be algorithmi- 
cally programmed by the designer of the model and that infeasible observations defined in terms of non 
verifiable claims (e.g., 'meta - information' based claims) can be avoided. 



States. E: set of observed states of the model in a simulation. 



The exact definition of a "state" would vary from one model to the other due to their irreducible 
design differences as well as the level at which observations are being made. A multiset []]can sometimes 
be used to represent state of a model by defining it as a collection of observable basic structures and 
their corresponding multiplicities in the model at any instance during its simulation. As an example, 
we can consider an observed state in the case of our example chemistry of binary strings, CBS, as a 
multiset - such that some specific state could be - 

{(00101, 2), (10101, 1), (010, 1), (0100, 1)(10100, 1)} 

Further illustrative examples can be seen in the case studies appearing in Section [3] 

Observed Run. T: set of observed sequences of states, ordered with respect to the temporal pro- 
gression of the model. Each such sequence represents one observed run of the model. A sequence of 
states is formally represented as a mapping: N — > E, where N is the set of non negative integers acting 
as a set of indexes for the states in the sequence. 

A temporally ordered state sequence is one of the basic building blocks in the framework upon 
which all other observed abstractions are made. Such a definition of a run of model implicitly implies 
that the framework is fundamentally based upon the dynamic simulations of the model and not upon 
static analytical inferences. This is in accord with the notion of "weak emergence" [BM P+OOj . which 
is a generic characteristic of most of the ALife studies. 

For a state S, 5—1, and 5+1 would denote the the states just before and after 5 in a state sequence. 

E and T thus define the underlying dynamic structure of the model T = (E,T). Using E and T, 
sometimes, a state machine model can also be used to define T. 



2.1.2 Entities and Their Characteristics 

Observer Abstraction 1 (Entity Set). E: set of entities observed and uniquely identified by the 
observer within a state and across the states of the model. 

The criterion to select the set of uniquely identifiable entities in a given state of the ALife model 
is entirely dependent on the observation process as specified by the ALife researcher. Thus for the 
same set of simulations of a model, there may exist very different observed states as well as entities. 
Nonetheless, same observation process must not yield different sets of entities in two identical states. 

Defining sound criterion to identify entities often requires a careful a attention since arbitrariness 



in defining entities might well lead to the problem of false positives as discussed later (see Section 5.3 ) 
"Tagging" can be sometimes used as a mechanism for the identification of individual entities when- 
ever there exist multiple entities in the same state which are otherwise indistinguishable. Thus an 
observer may associate and correspondingly identify every entity in a state using a unique tag. In 



1 A multiset M on a set E is a mapping associating nonnegative integers (representing multiplicities) with each element 
of E, M : E — > Af. Informally a multiset may contain multiple copies of its elements. 
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cases, where tags are selected such that they remain invariant under time progression of the model 
(i.e., do not change owing to reactions or interactions of the entities), the tags can as well be used for 
recognition of the persistence of these entities across the states of the model. 

For example, in case of CBS, an observer might identify individual strings as entities such that to 
distinguish syntactically identical strings, we can associate with every string an integer tag such that 
with tag i, an entity corresponding to the binary string s can be represented as [s]j. Thus a possible 
set of entities corresponding to the example state given above becomes 

{[0010rj l5 [00101] 2 , [10101] 3 , [010] 4 , [0100] 5 , [10100] 6 } 

Alternately another observer may choose to define entities as a tuples consisting of strings with three 
identical leftmost bits - giving the set of entities for the same state as 

{[00101, 00101]^ [10101, 10100] 2i [Q10,0100] 3 } 

Observer Abstraction 2 (State Function). F C E xE returns the state(s) in the state sequences 
in which a particular entity is observed. For a specific state sequence F can be treated as a function. 

The state information provided by F for entities will be used later to define valid evolutionary re- 
lationships among them. In general observers may use different mechanisms based upon the nature of 
model as well as the entities defined, to determine the state for a given entity. For example, as a simple 
mechanism, in case of CBS, the observer can maintain a table mapping entities to their corresponding 
states in order to define F. 

Having defined the sequence of states with temporal ordering and the entities identified by their 
tags, we will now proceed to discuss how an observer might define the detailed observable character- 
istics for such entities. Using these characteristics it can draw descendent relationship, as well as can 
establish presence of other components of evolution, e.g., heredity and variation. To this aim, we will 
define 'character space', as set of values for the observed characteristics. These values might be purely 
symbolic without any relative ordering or can be ordered using suitable ordering relation. 

Observer Abstraction 3 (Character Space). The observer should define the set of all possible 
orthogonal and measurable characteristics for possible entities in the model as a multi dimensional 
character space T = Chari x Char2 x . . . x Char n , where each of Chari is the set of values for i th 
characteristic. Each of Chari make one dimension in the space T. Each entity e e E is thus a point 
in T , say e = (vi, 1>2, ■ • ■ v n ), where Vi € Chari. 

For a vector x = (ai, a2> • • • , Q-r), i th element (ai) will be denoted as x[i]. For some of the charac- 
teristics observer might define a 'partial ordering' (<j for Chari e T), which can be used to compare 
values for those characteristics. The absence of any characteristics in an entity is represented by special 
zero element Qchari such that if Chari is(partially) ordered then Vw G Chari. c hari <i v - 

Notice that, observable characteristics need not to be limited to syntactic level or structural prop- 
erties and can also include semantic properties - observable patterns of behaviors. Though semantic 
properties are much more difficult to observe and measure than the syntactic ones since they require 
abstracting the patterns of reactions over a range of states. 

In case of CBS, for simplicity we may assume that model consists of binary strings of size n. In 
that case each position of the string can represent one orthogonal dimension and we have only two 
binary values ({0, 1}) at any position in a string for corresponding dimension. Thus character space T 
in CBS is n dimensional binary hypercube with each string occupying a possible diagonal end point. 
We will represent this hypercube as {0, 1}™. The ordering relation < for all dimensions is the same 
and defined as < 1. 

In terms of such character space T, an entity set E at any state can be defined by annotating 
the points in T with integer constants denoting the multiplicity of the entities present in E with 
characteristics defined by the point. 

2.1.3 Distance Measures 

Another important structure in the framework is the "dissimilarity measure" (D) to define the "ob- 
servable differences" (Diff) between the characteristics of the entities in a population. The distance 
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measure defined below can be used by the observer to distribute entities into separate clusters such that 
entities in the same cluster are sufficiently similar while entities from different clusters are distinguish- 
ably different in their characteristics. Again exact definition of distance function is model dependent. 

Observer Abstraction 4 (Distance Measure). An observer defines a decidable clustering distance 
measure D : E x E — ► Diff , where Diff is the set of values to characterize the observable "differences" 
between entities in E. 

Examples include the Hamming distance to define distance between genomic strings in the Eigen's 
model of molecular evolution [SchOlJ, set of points where two computable functions differ in their 
function graphs, or the set of instructions where two programs may differ. One of the known criterions 
to define the concept of species is "phenotype similarity" [Rid96 , which can also be seen as another 
example for distance measure. 

In case of CBS, we can define an auxiliary function © : {0, 1} x {0, 1} — > {0, 1} as a binary XOR 
such that we have 0©0 = 1©1 = 0, and 1©0 = 0©1 = 1. Thus the clustering distance measure 
D : ExE -> {0, 1}" is defined such that Vi.D(ei, e 2 )[i] = e 1 [i]®e 2 [i\, which implies that Diff = {0, 1}™. 
For example in case of two n = 3 bit binary entities e% — [001]i and e 2 = [101]2, D{ei,e 2 ) — 100. 
Other alternatives may include Hamming distance measure D{e\,e2) = S™=i( e i[*] © e 2W) with Diff 

{0. 1 />|. 

2.1.4 Observable Limits on Mutational Changes 

The observer needs to specify the limits under which it can recognize an entity across states even in the 
presence of mutational changes in the entity owing to its interactions with the environment. This is an 
inherent limiting property on the part of the observer and could vary among observers. Based upon the 
limit referred here as 5 mut , an observer can establish whether two entities in different successive states 
are indeed the same with differences owning to mutations or not. The smaller the limit, the harder 
it will be for an observer to keep recognizing entities across states and he would be counting mutated 
entities as the new entities. As entities are observed in more and more refined levels of details, their 
apparent similarities melt away and differences become sharply noticeable. 

Another type of mutations arise during reproduction, in which case an observer has to identify 
whether an entity is indeed an descendent of another entity even though they might not be similar. 
This necessitates us to introduce another bound on observable reproductive mutations as d rep _ mu t- 
This limit on observable reproductive mutations is indeed crucial while working with models where 
epigenetic development in the entities can be observed |MB97j . This is because in such chemistries 
including examples from real life, the "child" entity and the "parent" entities do not resemble with 
each other at the beginning and observer has to wait until whole epigenetic developmental process 
gets unfolded and then compare the entities for similarities in their characteristics. S repmu t assists an 
observer to establish whether a particular entity could be treated as a "descendent" of another entity 
or not. 

Another reason for introducing the limit S repmu t is that from the view point of an high level obser- 
vation process not recording every micro level details, it is quite essential to distinguish between parent 
entities and other secondary entities involved in the reproductive process. Consider, for example, a 
model where entity A reproduces according to reaction A + B — > 2A' + C, where A' is mutant child 
entity of A, which can be determined by an observational process only when it can establish that A and 
A' are sufficiently similar with respect to their characteristics, while A' and B are not. These limits on 
observable differences are formally defined as follows: 

Observer Abstraction 5 (Mutation Bounds). Based upon the choice of clustering distance measure 
D, the observer selects some suitable 5 mu t, S rep mu t S Diff, which will be used later to bound mutational 
changes (both reproductive and otherwise) for proper recognition. S mut and S rep mut are vectors such 
that each element specifies an observer-defined threshold on the recognizable mutational changes for 
corresponding characteristics. 

It is important to note that the choice of 6 mu t, S rep _ mu t critically affects further inferences. For 
example, a choice of very large values would result in the lack of identification of variability in charac- 
teristics and thus make it difficult to infer natural selection (discussed later) . On the other hand if an 
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observer decides to select very small values for S mu t then it cannot recognize persistence of an entity 
across states under changes, similarly small values for 5 rep _ mu t make it harder to establish reproductive 
relationship among entities and for such an observer every new entity would seem to be appearing de 
novo in the model. 

2.2 Evolutionary Components 

Having defined the observation process as a computable transformation from the underlying sequence of 
observed states of the model to the set of components involving entities and their observable character- 
istics with measurable differences as well as observable limits on such differences, we will now proceed 
with formalization of the fundamental evolutionary components: mutations, reproduction, heredity 
and natural selection. 

2.2.1 Mutations 

For evolution to be effective entities should change (mutate) over the course of their interaction with 
the environment (or other entities.) Moreover, there can also be observable differences between the 
child and the parent entities arising out of reproductive processes. These changes in the characteristics 
of the entities may or may not be inheritable based upon the design of the model and the simulation 
instance. 

Mutations can be considered of carrying two kinds of effects in the entities: one where mutations 
change the values for specific characteristics, secondly where after mutation an entity has at least one 
new character not present before or when certain characteristics are lost. We define a Recognition 
relation to establish the non reproductive mutational changes in the entities: 

Definition 1 (Recognition Relation). The observer establishes recognition of entities across states 
of the model with ( or without) mutations by defining the function R,5 mut ■' E ^ E, which is a partial 
function and satisfies the following axioms: 

Axiom 1. Ve,e' e E . Ra mut (e) = e' => F(e') = F{e) + 1. 

Informally, the axiom states that entities to be recognized as the same even with mutational changes 
have be observed in successive states. R5 mut is defined anti symmetric to ensure that entities are rec- 
ognized based upon the time progression of the model not in any other arbitrary order. 

Axiom 2. Rj mut is an infective function, that is, Ve, e' € E. Ra mut (e) = Ra mut (e') =^> e = e' 

Informally, the axiom states that no two different entities in one state can be recognized as the 
same in the next state. 

Axiom 3. Ve,e' e E. VChan e T. R«5 mut (e) = e' => dlffi ^ D(e,e')[i\ ^ 6 mut [i\ 

Informally Ra mut (e) is that e' G E, which is recognized in the next state by the observer as e in 
the previous state with possible mutations bounded by 6 mut . In other words if entity e mutates and 
changes in the next state and identified as e' , then observer might be able to recognize e and e' as the 
same if these changes (between e and e') are bounded by 5 mut - 

2.2.2 Reproduction 

Reproduction is one of the fundamental components of evolution. Through reproduction, entities pass 
on their characteristics to the next generation and increase the population size. Reproduction is pos- 
sibly the only way by which abstract entity structures can persist across generations in case of those 
Alifc models, where entities do not persist forever. In our framework, the way an observer establishes 
reproduction is by providing observed evidence for it. This is done by defining causal descendence 
relationships among the entities across states. The parent and the child entities are recognized by the 
observer as being sufficiently similar and "causally" connected across the states: 
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Figure 1: Graphical view of the relationships between entities in successive states. Recognition relation 
Rec, Causal relation C, and AncestorOf . 

Observer Abstraction 6 (Observed Causality). C C E x E . C establishes the observed causality 
among the entities appearing in the successive states. C satisfies the following axiom: 

Axiom 4 (Causality). Ve,e' GB.(e,e')eC^ [F(e') = F(e) + 1] A $e" G E . F{e") = F(e)AR s t 
(e") = e'] 

Informally, the axiom on causal relationship C states that, if an entity e is causally connected to 
another entity e', then the observer must observe e' in the next state of e and never before. This is 
to ensure that mutations are not confused by the observer with reproductions. Notice that in order to 
establish causal relation between entities, observers need not necessarily know the underlying reaction 
semantics or the micro level dynamics of the model. Only requirement is that the observer's claimed 
causality conforms with the stated axiom. In essence, this formulation of causality is an abstract 
specification which demands observers to identify the entities which have been observed to be causal 
sources for the appearance of a new entity. Only then proper descendance relation for the new entity 
can be established. 

Apart from causality C we also need auxiliary relation A to determine that the differences due to 
the reproductive mutations are also bounded by 6 rep _ mu t. 

Definition 2. A C E x E such that Ve, e' G E . (e, e') G A V 'Chart £ T . if Chari has an ordering 
then D(e,e')[i] dn <W„m«t[*]- 

Informally for (e,e') to be in A, their differences for each single characteristic Chari must be 
bounded by 8 rep _ mut [i] . 

Based on the thus established notion of "causal" relationships between entities and A, we will 
define AncestorOf relation, which connects entities for which an observer can establish descendence 
relationship across generations. 

Definition 3. AncestorOf = ( (C U Ra mut ) + n A) + 

In this definition the (inner) transitive closure of (C U R<5 mut ) captures the observed causality (C) 
across multiple states even in cases when "parent" entities might undergo mutational changes (Ri mut ) 
before "child" entities complete their "epigenetic" maturation with possible reproductive mutations. 
Intersection with A ensures that causally related parent and child entities are not too different from 
each other, that is, reproductive mutational changes are under observable limit. Outer transitive closure 
is to make AncestorOf relationship transitive in nature so that entities in the same lineage can be 
related with each other. For e, e' G E, (e, e') G AncestorOf, describes that e is observed as an ancestor 
of e'. 



Figure [T] depicts graphically the relationships between entities in successive states. Vertical lines 
represent the states (So, Si, S 2 , S3, S4). Various kinds of arrows represent different relationships: recog- 
nition relation R<5 mut , causal relation C, and AncestorOf . The end points of the arrows on state lines 
represent entities. 

Claim 1. Case of Reflexive Autocatalysis. 

Proof. In the simplest form, a reflexive autocatalytic cycle is represented as a system of reaction 
equations: 

A + X 1 = A l +Y l 
Ai + X 2 = A 2 + Y 2 

A n ^ + X n = mA' + Y n 

where m copies of entity A' are produced at the end and that entity A' is a variation of entity A, i.e., 
(A, A') e A. Such autocataltic cycles are supposed to be the chemical basis of biological growth and 
reproduction. Examples include the Calvin cycle, reductive citric acid cycle, and the formose system. 
Competing cycles of this sort can even undergo limited evolution, though they are supposed to have 
very limited heredity [SS97 . 

In the current framework suppose an observer could determine the causal relations - (A,A\), 
(Ax, A 2 ), . . ., (A n -\, A'). Also assume that entity A does not undergo any changes before A' is pro- 
duced, that is, (A, A) e R m ut- Then (C U R<5 mut ) + would contain (A, A') so also would ( (C U 
R<5 mu t) + n A) establishing the reproduction of A through reflexive autocatalytic cycle and with 
variation. □ 

Claim 2. Recognition of reproductive relationships under parental mutations together with reproductive 
mutations and epigenetic developments in the child entities. 

Proof. Let us see what it requires for establishing reproductive relationship when (parent) entities 
might be undergoing changes across states and child entities not only differ from the parent entities 
owing to reproductive mutational changes but also that there exist epigenetic developments in the child 
entities, which make it harder for any observer to establish similarities between child and parent entities 
by observing the child entities only in the beginning (i.e., in the state when child entities were observed 
for the first time.) Naturally it would require that an observer observes child entities so long that their 
epigenetic development unfolds completely - since in general there cannot be any fixed limit on the 
number of states required for such epigenetic development, we capture this requirement of observations 
across states using transitive closure - (C U R<5 mut ) + , where R,5 mut ensures that (mutational) changes 
in the parent entities and also the changes in the child entities during epigenetic development arc 
accounted for. 

Lets us assume that in a state Si, a child entity c was observed for the first time and (parent) entity 
p present in the state Si-i was observed to be casually connected to it. Suppose that for entity e its 
epigenetic development unfolds through states Si+%, Si+2, ■ ■ ■ , Si+ r such that with changes owing to the 
development c was observed as C\,c 2 , ■ ■ ■ ,c r in these states with (c, C\), (c\, c%), . . . , (c r _i, c r ) G R,5 milt ■ 
Similarly suppose that parent entity p undergoes mutations in these successive states and observed as 

Pi,P2 ,Pr such that (p,p 1 ),(pi,p 2 ),. . ■ ,(p r -i,Pr) € R« mut . It is clear that (C U Ra mut ) + would 

contain (p, c), (p, C\), . . . ,(p, c r ), . . . , (p r , c), (p r , c\), . . . , (p r , c r ) among other tuples implying that the 
intersection of (C U R<5 mut ) + with A would result in those tuples (p m ,c n ), where p m and c n are 
sufficiently similar in their characteristic. Therefore if the resultant set ( (C U R,5 mut ) + n A) + 
is not empty, the observer can establish the reproductive relationship between entities p and c even 
under parental mutational changes and the epigenetic changes and reproductive mutations in the child 
entity. □ 

Using AncestorOf relation, we now can consider the cases of entity level reproduction and Fecun- 
dity: 
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Case 1: Entity Level Reproduction 

We consider the case where instances of individual entities can be observed as reproducing even though 
there might not be any observable increase in the size of the whole population. 

For a given simulation of the model, an observer defines the following Parent a relation: 



Definition 4. 

Parent a = {(p>c) G Ancestor Of | 

3e G E . [(p, e) G AncestorOf A (e, c) € AncestorOf]} 

The condition in defining Parent a is used to ensure that p is the immediate parent of c and thus 
there is no intermediate ancestor e between p and c. Using Parent a relation, in order for the observer 
to establish reproduction in the model, the following axiom should be satisfied: 



Axiom 5 (Reproduction). 3state sequence T E T . Parent a ^ 

This means, if there is reproduction in the model, then there should exists some simulation T G T 
of the model, where at least one instance of reproduction is observed. 

In case of CBS, we consider a very simple model of reproduction, where at any state of the model 
some of the strings are randomly chosen and are copied with some random errors. How it is done 
remains hidden from the observer but the observer can observe which parent entities are chosen for 
copying and can establish causal relation between these parent and their copied child entities if the 
random errors occur only at even positions as the way 8 rep _ mu t has been defined in Section 2.1.4 It 



can be easily seen that under such construction scheme Axiom of Reproduction will be satisfied. 



Case 2: Population Level Reproduction - Fecundity 

Though entity level reproduction is essential to be observed, for natural selection it is the population 
level collective reproductive behavior (fecundity), which is significant owing to the carrying capacity 
of the environment. Since carrying capacity is an limiting constraint on the maximum possible size of 
population, an observer needs to establish that there is no perpetual decline in the size of the popu- 
lation. In other terms for all generations, there exists a future generation that is of the same size or 
larger. This allows cyclic population sizes where the cycle mean grows (or stays steady) over time. Also 
in case of fecundity, an observer need not to observe all the parents in the same state, nor do children 
need to be observed in the same states of the model. Formally we require the observer to establish 
Fecundity by satisfying the following axiom: 



Axiom 6 (Fecundity). There exist infinitely many different generations of entities in temporal order- 
ing G U G 2 , ■ ■ . such that (VG, C E)(3G j>i C E) . \GA > \G,\ where Gj = {c € E \ 3a G G t . (a, c) € 
AncestorOf}, (operator \.\ returns the size of a set.) 



Informally, the axiom states that for every generation of entities (Gj), in future there exist generation 
of its descendent entities (Gj) such that the size of descendent generation must be equal or more 
than current generation. Note that the granularity of the time for determining generations is entirely 
dependent on the design of the model and the observation process. 

We can now formulate another important axiom from evolutionary perspective, which asserts that 
reproduction in the model should not entirely cease because of the (harmful) mutations. 



Axiom 7 (Preservation of Reproduction under Mutations). Some mutations do preserve re- 
production. Formally, 3e G E . Ch e = {e' G E : (e, e') G Parent A U R,5 mut } ^ => 3e" € Ch e . {e' G 
E : (e", e') G Parent a} =/= 

Informally, this means, there exists entity e G E, which reproduces (with mutations) and one of 
those (mutant) children of e can also further reproduce. Ch e denotes the set of children of e. 

In case of CBS, since copying mechanisms do not work differently based upon selected entities, 
hence the errors during copying process do preserve the above axiom of Preservation of Reproduction 
under Mutations. 
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2.2.3 Heredity 



Heredity, yet another precondition for evolution, can in general be observed in two different levels: 
Syntactic level and Semantic level. On syntactic level, entity level inheritance is implied by the struc- 
tural proximity between parents and their progenies ranging over several generations - though in case 
of continuous structural changes in the parental entities and epigenctic development in progenies, this 
would require an observer to establish structural similarities over a range of states as discussed earlier 
with the definition of AncestorOf relation. Also for syntactic inheritance to persist, design of the 
model needs to ensure that environment, which controls the reaction semantics of entities, remains 
approximately constant over a course of time so that structural similarities also result into continued 
reproductive behavior. 

Difficulty arises primarily on the level of multi parental reproduction - in this situation an observer 
might have to stipulate some kind of gender types and might have to relax the mechanism of recognizing 
the parent-child relationship in a way as happens for example in case of organic life, where male-female 
reproductive process (often) gives birth to a progeny belonging to "only" one gender type. In such 
a case, for heredity, an observer need to ensure that, over a course of time all the gender types are 
sufficiently produced in the population. 

On the other hand it is also possible to observe inheritance on the semantic level (ignoring structural 
differences) in terms of semantic relatedness between entities, whereby an observer can observe that 
progenies and their parental entities exhibit similarities in their (reproductive) behaviors under near 
identical set of environments. This in turn would require an observer to identify the possible sequences 
of observable reactions between existing entities, which appear to be yielding new set entities (children) 
and in the child generation as well there exist a similar observable reproductive process, which enables 
the (re) production of entities. Such an observation would enable the observer to abstract the repro- 
ductive processes currently operational in the model. The inherent difficulties in this view are obvious 
- in essence an observer needs to abstract the reproductive semantics from observable reactions in the 
model, which in turn might require non trivial inferences in absence of the knowledge of the actual 
design of the model. 

Considering the case of real-life from an observational view point, semantic view is in fact an 
abstraction over all the reproductive processes existing across various species and levels including the 
case of bacterial organisms, where next generation of bacteria may contain a mix of genetic material 
from various parental bacterium of previous generation through the process of horizontal transmission. 
So while in case of syntactic inheritance an observer would only be able establish inheritance across 
organisms belonging to same species, using semantic view, he could expand his horizon to the all organic 
life as a whole. 

However, heredity as a mechanism of preservation of syntactic structures, appears to be crucial for 
those ALifc models where entities have very limited set of reproductive variations possible, that is, where 
environment supports only rare forms of entities to reproduce and any changes in the syntactic structure 
of these reproductive entities may result in the elimination of the reproductive capability. Real-life on 
earth as well as the model of the Langton loops (as discussed further in Section |3.2| are definitive 
examples where most of the variations in the genetic structure, or the loops geometry/transition rules 
result in the loss of reproductive/replicative capabilities. 

Also heredity usually requires further mechanisms to reduce possible undoing of current mutations 
in future generations owing to new mutations. Therefore, in order to establish inheritance in ALife 
models, sufficiently many generations of reproducing entities need to be observed to determine that the 
number of parent-child pairs where certain characteristics (both syntactic and semantic) were inherited 
by child entities without further mutations is significantly larger than those cases where mutations 
altered the characteristics in the child entities. We can express it as the following axiom: 

Axiom 8 (Heredity). Let a statistically large observed subsequence of a run T: 

tt = lim N ^ ca {S n , . . . S N },n < N 

Consider Parent A = {(e, e') € Parent a \F(e) £ £lAF(e') € f2} to be the set of all parent - child pairs 
observed in Q. Again let Inherited^ = {(e, e') 6 Parent A \3Chari 6 T . D(e,e')[i] — Odiff } be the 
set of those cases of reproduction where i th characteristics were inherited without (further) mutation. 
Then high degree of inheritance for i th characteristics Chari implies that | Parent A |/|Inherited^| 
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~ 1. For syntactic inheritance to be observed in a population of entities, we should have some such 
characteristics which satisfy this condition. 

The axiom of heredity together with the axiom of preservation of reproduction under mutation 
ensures that reproductive variation is maintained and propagated across generations. 

2.2.4 Natural Selection 

There are several existing notions of selection in the literature on evolutionary theory [Fut98 , Rid96 
Rid97, SHOO, MB 97 ) lKim83j . In case of our observation based framework we choose to define natural 
selection as a statistical inference of average reproductive success, which should be established by an 
observer on the population of self reproducing entities over an evolutionary time scale i.e., over statis- 
tically large number of states in a state sequence. Other notions of selection using fitness, adaptedness, 
or traits etc. are rather intricate in nature because these concepts are relative to the specific abstraction 
of "common environment" shared by entities and "the environment-entity interactions" , which are the 
most basic processes of selection. Nonetheless selecting appropriate generic abstraction for these from 
the point of view of an observation process is not so simple. Therefore we consider more straight- 
forward approach based upon the idea that on evolutionary scale the relative reproductive success is 
an effective measure, which is also an indicator of better adaptedness or fitness. We thus define the 
following (necessary) axioms for the natural selection: 

Axiom 9 (Observation on Evolutionary Time Scale). An Observer must observe statistically 
significant population of different reproducing entities, say A (\A\ 1), for statistically large num- 
ber of states in a state sequence T G T '. That is, for a statistically large subsequence Q of T, fi = 
limpf^ 00 (S n , . . . Sn),ti <C N, the observer defines the set of reproducing entities A C (Js ef2 SR(Sj), 
where SR(Sj) = {e G E\ [F(e) = Sj] A [Be' G E . (e, e') G Parent^]} is the set of all reproducing 
entities in state Sj G fl. 

Axiom 10 (Sorting). Entities in A should be different with respect to characteristics in T and there 
should exist differential rate of reproduction among these reproducing entities. Rate of reproduction for 
an entity is the number of child entities it reproduces before undergoing any mutations beyond observable 
limit. 

In other words, Rate rep : E — > N + defined as Ve G E . Rate rep (e) = \Child e \ where Child e — {ef G 
E\3e" G E . (e",e') G Parent A and [Rj mut (e) = e"AV(7kr, G T . D(e,e") = djJ? J}. 

The above two axioms though necessary are not sufficient to establish natural selection since these 
cannot be use as such to distinguish between natural selection with neutral selection |SH00| . The 
following axioms are therefore needed to sufficiently establish natural selection. 

Axiom 11 (Heritable Variation). There must be variation in heritable mutations in population of 
A. Formally, let 

Child rnut — {e G A|3e' G A . (e, e') G Parent A A [3Chari G T . dtffi -< D(e,e')[i]}} 

be the set of child entities carrying reproductive mutations. Let Var_Child mu t C Child mut be the set 
of those child entities which carry different mutations with respect to characteristics in T , that is, 

Ve, e G VarJJhildmut we have 3Chari G T . Qdiff -4 D(e, e 

Then axiom of heritable mutation demands that \VarJJhild mut \ 3> 1, that is, there are significantly 
many child entities carrying different mutations. 

Axiom 12 (Correlation). There must be non zero correlation between heritable variation and differ- 
ential rate of reproduction. Formally, 

yChari G T . Ve, e G Var_Child mut ■ the following two conditions should hold: 

i) e[i] <i e'[i] <^[Rate rep {e) < Rate rep {e')] V [Rate rep (e) > Rate rep (e')} 

ii) e[i] =i e'[i] Rate rep (e) = Rate rep (e') 
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Informally, this means as the value of characteristics inherited by the child entity changes, rate 
of reproduction also changes. Based upon the environmental pressures with respect to a particular 
characteristics, rate of reproduction might cither increase or decrease as the characteristic changes. 

The last two axioms state that there must be significant variation in population (in characters) of 
entities which must be maintained for evolutionarily significant periods and that this variation must be 
caused by the differences in inheriting mutations from the parent entities, which in turn directly affect 
the rate of reproduction. 

Having formalized the fundamental component of evolutionary processes to be observed in a model, 
we will illustrate the framework on two important ALife models in the following Section. These 
illustrations will later be used in concluding Section [5] to extract generic design principles for ALife 
research. 

3 Case Studies 

3.1 General Considerations 

Having described the generic formal framework in Section [2j which formalizes the concept of observa- 
tions and consequent axiomatic inferences to establish the level of evolution for ALife studies, in the 
following sections, we will apply the formalism to different models as case studies. These case studies 
include Cellular Automata based Langton Loops |Lan97] and A Calculus based Algorithmic Chemistry 
Fon92]. The case studies elaborate the steps and technical details specific to the example universe of 
the model, which remained implicitly defined in the generalized description of the framework. 

For a given model, the steps to instantiate the framework can be described as follows: The obser- 
vation process works on the simulations of the model which iteratively change the underlying states 
based upon the application of the updation rules of the model. The observation process starts with 
the identification of states of the model (£) during its simulations (i.e., state sequences T)). Usually 
any change in the model (i.e. the changes in the set of basic units) may give rise to a change of the 
observed state. It is important to note that in some cases there might be any changes in the observable 
state of the model even tough there is ongoing underlying activity in the model, that is, when model 
reaches, for example, a fix point. 

For every state in the state sequence, the observation process (or the observer) needs to identify a 
set of well defined entities with suitable tagging for individual identification (E). These entities need 
to be described in terms of their characteristics (T). Next important task is to define the limits on the 
observable mutational changes in individual characteristics of the entities (5 mu t, S rep _ mu t), which will 
in turn define the recognition relation (R<5 mut ) to relate entities persisting across states of the model 
as well to determine whether two entities might be considered related under dcsccndcnt relationship. 

Once the sets of entities in various successive states of the model as well as their characteristics 
are known, important evolutionary relationships need to be established between them. These evolu- 
tionary relationship depend upon the intermediate causal relation (C) between the entities as observed 
under the mechanics of observation process. Using the limits on mutational changes as well as causal 
relationship between entities, we proceed to define the Ancestor (AncestorOf) and the Parent sets 
(Parent a). These sets determine whether there are entities which might be potentially reproducing 
in the model, even with observable changes between parent and child entities (A). 

Next stage of the observation process is to ascertain the level of effectiveness of evolution in the 
model. Using the long term observations on the model for statistically large number of generations, 
one can infer some statistical patterns for degree of heredity and variation. For natural selection 
to be effective, there should exist large number of reproducing entities with significant variation in 
their characteristics such that there exists correlation of this variation in the characteristics with the 
reproductive success of the entities. 

This process at the end establishes the validity of all or some axioms of the framework for the given 
model which provides clues to the degree upto which evolutionary processes might be effective in that 
model universe. The case studies in the following sections will illustrate this process in detail. 

In these case studies, constructs not explicitly defined are assumed to be same as what is defined 
in the framework. 
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3.2 Case Study 1: Langton Loops 



Research on the self reproduction has a long cherished history starting in early fifties [Bur 70, Sip98 
FM04 . After the pioneering work of Alan Turing in early 40s to define the mechanical meaning of 'com- 
putation' as a Turing machine transitions, John von Neumann defined Cellular Automata (CA) [yN66 
to explain the generic logic of self reproduction in mechanical terms. His synchronous cellular automata 
model was a two dimensional grid divided into cells, where each cell would change in parallel its state 
based upon the states of its neighborhood cells, its own state and its transition rule. For such CA 
model, von Neumann defined a virtual configuration space where he demonstrated analytically that 
there exists some universal replicator configuration which could replicate other configurations as well as 
itself. Though universal replicators are not found in nature and such self replicator was extremely large 
in its size, nonetheless the underlying logic of treating states of cells in the grid both as 'data' as well as 
'instruction' was very fundamental contribution of this model and that was exactly was was discovered 
later in case of real life where DNA sequences specify both transcription as well as translation for their 
own replication in a cell. Another strength of von Neumann's formulation was its ability to give rise 
to unlimited variety of self replicators McMOOa McMOOb . Over the years this model was simplified 
and reduced in size considerably [Cod68, 81-105]. 

Finally Langton introduced loop like self replicating structures in Lan84 , which retained the 
'transcription - translation' property of von Neumann's model excluding the capability of universal 
replication and symbolic computation. Langton's original self-replicating structure is a 86-cell loop 
constructed in two-dimensional, 8-state, 5-neighborhood cellular space consisting of a string of core 
cells in state 1, surrounded by sheath cells in state 2. These loops have since then, been extended into 
several interesting directions including evolving Evoloops in |Say98|. 

These cellular automata based ALife models offer the ideal example for our observer (observation 
process) based framework since these replicating loops and their variations evolve only with respect to 
some high level observation process, which can be used to define entities (loops) and their evolution. 
We will illustrate the formal framework by instantiating it on the Cellular automata based Langton 
loop model. Further details on the model itself can be found in the above references. 

Instantiating the Framework 

We consider the case of two dimensional CA lattice based model. An observation is defined on the CA 
model by assuming an underlying coordinate system such that each cell in a two dimensional cellular 
automata (CA) lattice can be associated with unique coordinates (represented as (x, y).) A cell is then 
completely represented as ((x,y),s), where s € [0..7] is the state of the cell. When a cell is in state 0, 
it is also known as a quiescent cell. Let us denote the set of all cells of a CA model as Cell, which is a 
potentially infinite set. 

For a given cell ((x,y),s) € Cell, its coordinates can be accessed as follows: co x (((x,y), s)) = x, 
co y (((x,y),s)) — y, which can be extended to the set of cells: VZ C Cell, co^(Z) = {J ceZ co x (c), 
cOy{ z ) = Ucez c °y( c )- 

Neigh : Cell — > 2 Cel1 gives the coordinate wise non quiescent cells in the surrounding neighborhood 
of a cell. Formally, V(c = ((x, y),s)) £ Cell we have 

Neigh(c) = {((x±l,y),s'),((x,y±l),s') \s'^0} 

The model Structure 

A CA-based model is usually initialized by setting some finite number of selected cells to non-quiescent 
states. At each step, state of every cell of the model is changed as per the state transition rules. 
Therefore we define for an observer state of the Langton's model as the subset of Cell consisting of 
only non quiescent cells. It is clear that for the observer change in a state is observable only if there is a 
change in the set of non quiescent cells. The state of the model for the observer will also be referred to 
as configuration. Thus S denotes the set of all possible different configurations and a state sequence 
in T is a sequence of configurations observed in temporal order by the observer starting from some 
specific configuration. In the following discussion we will consider a fixed sequence given as T G T , 
starting with a specific initial state given in Figure [2] (Time 0). For the fact that there exist a temporal 
(total) ordering of states in T, we can also associate an integer sequence I — [0, 1,2,.. .] with T, which 
works as an indexing for the states. With the above structure of Langton's CA model, the observer 
takes the following decisions. 
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Entities 



Each entity in some state is characterized by two values - the connected set of non quiescent cells and 
the associated pivot. Two cells are connected only if there exists a consecutive sequence of neighboring 
non quiescent cells joining them in the lattice. The (function) pivot gives the coordinates for a cell 
uniquely associated with an entity in CA lattice in a particular state. Formally the set of entities 
(loops) in the model is defined as follows: 

E = {[Z,pivot(Z)] | 3 a configuration S € T . 

[Z C S A Z ,6 0] A [Vc G Z . 3d E Neigh(c) . c' e Z}} 

To define pivot, an observer may choose the coordinates of top left hand corner cell of an entity as the 
pivot for it. Formally 

pivot(Z) = (min{cc<x (Z)} , max{cOy (Z)}) V(e = [Z , pivot (Z)]) € E 

This gives an obvious characterization for a two dimensional character space T = Char\ x Char-i with 
Char 1 being the set of all non quiescent connected set of cells and Chari being the set of corresponding 
pivots. We do not associate additional tags with entities because pivots can be used to uniquely identify 
them in any state of the model. 

State Function 

F : E I is defined using a table which associates with each entity e e E, the index i e I for the 
state in which e is observed. 

Distance Measure 

Distance function D:£x£^{0,l}x{0, 1} is defined such that Ve, e' e E . D(e, e') = [d g , d p ] where 
d g and d p are defined as follows: d g is only if both entities have the same number of cells arranged 
identically or else it returns 1. d p is when the pivots for both the entities are same and 1 otherwise. 

Limits on Observable Mutations 

The observer next selects S mut — [1,0], which means that observer can recognize an entity in future 
states even with mutations (changes in the states, number, or the arrangement of cells comprising 
the entity) provided that the pivot remains the same. Select S rep _ mu t = [0,1] which implies that for 
reproduction observer strictly demands identical geometrical structure of the parent and child entities, 
though may have different pivots - this is essential to capture exact replication of the loops. 

Observing Reproduction and Fecundity 
Recognition relation R<5 mut : E — > E is defined as follows: 

Ve, e' e E, R 5mut (e) = e' & [F(e') = F(e) + 1] A [D(e, e') < S mut ] 

Informally this means two entities in consecutive states are recognized same only if they have the same 
pivots. Which also means observer can recognize entity even with change in the number, state, and 
geometrical arrangement in the cells of an entity across states provided that entity does not shift in 
CA lattice altogether (which would result in the change of the pivot.) 

Lemma 1. Ra mut satisfies Axiom 1, Axiom 2, and Axiom 3. 

Proof. Axiom 1 and Axiom 3 are satisfied by definition. Axiom 2, which states that R ( 5 mut is an 
injective function holds because no two entities in the same state share the same pivot. This is because 
pivot as defined above is connected to all other cells of the entity and all the non quiescent cells which 
are connected in any state are taken together as one entity. Thus two different entities in the same 
state always consist of cells such that cells in one entity are not connected with the cells of second 
entity, and hence always have different pivots. □ 
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Causal relation The relation C between entities in consecutive states is denned as follows: C C 
E x E such that Ve,e'€E where e = [Z e , pivot (Z e )] and e' = [Z e i ,pivot{Z e i)\ we require 

' 1. co+(Z e ) D co+(Z e 

2. co+(Z e ) D co+OZeO 

3. pivot{Z e ) ^ pivot(Z e i) 

4. F(e') = F(e) + 1 

Intuitively what we demand with above definition of causal relation C is that child entity was part of 
the parent entity and at certain stage it "breaks off" from the parent entity as can be seen in Figure [2] 
at time step 127. 



(e,e') e C 



Lemma 2. Causal relation C defined above satisfies the Causality Axiom. 

Proof. Condition F(e') = F(e) + 1 insures that e and e' are not observed in the same state. To 
establish that e' is not the result of mutations in some other entity e" observed in past (i.e., [F(e") = 
F(e)] A [Rec(e") = e']) we note that because of the definition of Rec, e" and e' would otherwise have 
the same pivots, which means pivot of e" will be included in the set of cells in e (since [co+(Z e ) Z> 
co£(Z e i)] A [cOy(Z e ) D cOy(Z e >)]), which is not possible because e and e" being different entities in 
the same state cannot have cells in common including pivot as argued above in the proof of previous 
lemma. □ 




Time 



Time 60 Time 100 




Time 126 Time 127 Time 151 



Figure 2: Self-Reproduction in Langton loops; screen shots from |Bac07j 



Lemma 3. Axiom of Reproduction and the Axiom of Fecundity are satisfied by the entities and 
abstractions on Langton Loops described above. 

Proof. These two axioms can be established by the observer in a specific state sequence as exemplified 
in Figure [2] and Figure [3] by repeatedly applying the recognition relation Rec when entities are changing 
in number and states of cells (retaining the pivots) and applying the causal relation when a parent 
entity splits (e.g. at Time=127). The relation A connects the initial parent entity and the child entity 
at Time=151. 

With respect to Figure [2j an entity is identified at Time=0 with associated pivot. Between time 
steps [1 . . . 126] entity changes in number and states of its cells but the pivot remains the same, hence as 
per the definition of Rec, the observer can recognize the entity in these successive states. At Time=127, 
the (parent) entity is observed to be splitting into two identical copies. One of these is again recognized 
as the original parent entity because of its pivot and the second entity would be claimed to be causally 
related with the parent entity as per the definition of C. To see this, notice that the parent entity at 
Time=126 contains all the cells of the child entity appearing at Time=127, which satisfies the definition 
of C . Between time steps 128 and 151 both parent and child entities undergo changes in the number 
and states of their cells but their pivots remain fixed. Hence they can again be recognized. Finally at 
Time=151 the child entity becomes identical to the original parent entity, therefore the parent entity 
at Time=0 and the child entity at Time=151 are related using A. The transitive closure finally give 
us the final descendence relationship between the parent and the child entity. □ 
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Gen 4 Gen 6 



Figure 3: Fecundity across generation in a population of Self Replicating Langton Loops; screen shots 
from |Bac07] 



Mutations, Inheritance, and Natural Selection 

Primary focus of Langton while defining the CA based replicating loop model was to demonstrate that 
genotype - phcnotype based coding decoding scheme can be captured in CA universe as well |Lan97j . 
And we have seen that this can be observed by the observer as defined above. Nonetheless, Langton 
loops do not exhibit mutations and indeed if we analyze the underlying state transitions defined for the 
cells in the model, it becomes clear that the transition behavior required for the reproduction changes 
immediately if any changes are introduced in an entity and resulting entity is no longer capable of 
reproduction or in other terms, none of the mutations in existing replicating loops preserve reproduction 
and in terms of the current framework Axiom of Preservation of Reproduction under Mutations is 
not valid. Because of the enormity of possible configurations and transition dynamics it is not easy 
to analyze which kind of replicating loops can ever withstand certain mutations and can preserve 
replicating functionality. Heredity of course is worth considering only when entities mutate and continue 
reproduction. Thus with existing Langton loops, an observer cannot observe heredity and subsequent 
natural selection. 

The extension of Langton loops defined by Sayama as Evoloops in |Say98| is one such attempt, 
where not all the loops in the model are of the same type with respect to the number and geomet- 
rical arrangement of cells and final population witnesses (small) variety of different kinds (in size) of 
reproducing loops scattered on the lattice forming colonies. The Evoloops and their evolution can be 
formulated in the framework by suitably modifying the definition of the distance measure D to measure 
the differences between the entities in the number and geometric arrangement of cells and by changing 
limit 5 rep _ mut such that the observer is able to establish descendence relationship even when the parent 
and the child entities (loops) are not identical. Since evoloops of different types replicate at different 
rates, where rate of replication is measured in terms of number of state transitions, we can infer that the 
loops satisfy the axiom of sorting. Indeed in a weak sense with available simulation results it appears 
that evoloops can be observed demonstrating heredity as well as selection. 

Conclusion 

We have seen that we can formally define an observation process on the CA universe which discovers 
the self replication of so called Langton loops during the simulation of model. The specific observer 
presented here follows the intuition that Langton implicitly stated when describing the loops. We 
also noted that mutations, heredity, and selection based axioms are not met in the model where this 
limitation can be attributed to underlying transition rules of the model. Evoloops, which were designed 
as extensions of Langton loops with mutations can be seen to be evolving with variation in the sizes 
and rates of reproduction. 
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3.3 Case Study 2: Algorithmic Chemistry 



Algorithmic Chemistry (AlChemy) was introduced in |Fon92] and further discussed in jFB94al [FWB94 
FB94b, FB96]. The main focus of the AlChemy is to study the principles behind the emergence of 
biological organizations with the approximate abstraction of real chemistry as A calculus with finite re- 
ductions. Starting with a random population of A terms (molecules), using different filtering conditions 
on reactions, authors describe the emergence of different kinds of organizations: Level organization 
consisting of a set of self copying A terms and hypercycles with mutually copying A terms, Level 1 self 
maintaining organizations consisting of A terms such that every term is effectively produced as a result 
of reaction between some other terms in the same organization and lastly Level 2 organization consisting 
of two or more Level 1 sub organizations such that molecules migrate between these self maintaining 
sub-organizations. They also provide detailed algebraic characterization of Level 1 and Level 2 orga- 
nizations without referring to the underlying syntactical structure of the A terms (molecules) or the 
micro dynamics (reduction semantics and filtering conditions) governing the output of reactions. 

Instantiating the Framework 

In view of the proposed observer based framework, characterization of self replicating molecules and 
hypercycles consisting of mutually copying molecules is achieved by defining an observation process, 
which focusses on individual A terms as entities and identifies hypercycles as a set of individually 
replicating A terms in a sequence of reaction steps (reflexive autocatalysis) . 

Since Level 1 and Level 2 organizations emerge only when self copying reactions are filtered out (i.e., 
self reproduction is not allowed) to ensure that Level organizational structures do not become the 
fixed points, these cannot be analyzed under the current framework design because we only consider 
reproduction, mutation, inheritance, and selection based evolution and emergence of organizations. 

The Chemistry Structure 

A chemical soup of AlChemy consisting of A terms as molecules is usually initialized with a population 
of large number of randomly generated A terms. A state of the chemistry could, therefore, be considered 
as the collection of all these A terms (with multiplicity). Since every non elastic reaction results into 
introduction of output A term into the soup and possible removal of some other randomly chosen terms, 
it is natural to consider such succession of states after every reaction step as a state sequence TeT. 

The components of the observation process defined next are based upon the assumption that it is 
possible to observe the inputs terms for a reaction (collision) , resultant output term to be added to the 
soup, and the randomly deleted terms from the soup, without knowing the actual reaction details or 
the reduction semantics. 

Entities 

For a given state of the chemistry, let the observation process identify each A term as a separate entity 
associating an unique integer tag with it. Each such entity is represented as [w,i] where i is the tag 
uniquely associated with A term w. E is the set of all such entities in the chemistry. 

Tagging: Suitable tagging mechanism needs to be defined by the observer to recognize whether two 
A terms in successive states are the same and to distinguish between multiple syntactically identical 
copies of a A term in the soup at any state. We can associate tags of the form {i S i ze ,ii ex ,im U i) 
(isize,iiex,imui € N) with the individual molecules in the following way: for the initial population of 
A terms, they are arranged with respect to their sizes and we assign the size of these terms as the 
first component in their tags (i s iz&) & n d for terms of same size arrange them lexicographically and 
assign in increasing order second component of their tags (ii ex ) such that multiple copies of a term 
have the same first two components of their tags and then assign increasing integers to each of these 
as their third component of the tag {i mu i). Under such tagging scheme a small population of A terms 
{Ax. a;, Xx.x, Xx1.Xx2.x2] defines the state - {[Ace. a;, (3, 1, 1)], [Xx.x, (3, 1, 2)], [Xx1.Xx2.x2, (5, 1, 1)]}. For 
a given tag tg = k) its components are accessed as i = tg[X],j — tg[2], and k = tg[3]. 

Next we discuss the mechanism for updating these tags after reaction and elimination steps. We 
increment by one the third component of the tags for each entity , which was not deleted from the soup 
from previous state and give new unique tag to the new terms added to the soup with respect to their 
position in the list of terms based on their size and lexicographic order such that third component of the 
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Denote (A B) => p C 
by: 



6 



Hypercycle: {A, BJ 



(A A) => p ... => p B 
(A B) => p ... => p B 
(B B) => p ... => p B 
(B A) => p ... => p A 




i 




Figure 4: Example of self replicating elementary hypercycle organization in AlChemy from [FB94aJ. 
A = Xxi.Xx2-X2 and B = Xx.x . (AB) =$-p C represents reaction between A and B by applying A on 
B yielding C under (3 reduction. 

newly added terms is always given value 1. This numbering scheme reliably maintains the recognition 
of terms across states of the chemistry. 

Distance Measure 

Distance function D : E x E — > {0, 1} x {0,1} is defined such that V(e = [w,t g ],e' — [w',t']) £ 
E . D(e, e')[l] = if w and w' are the same with respect to a renaming implying that entity e' is 
the same entity e in the previous state; otherwise D(e, e')[l] = 1. D(e,e')[2] — if t' [3] — t 5 [3] = 1 
indicating that entity e is observed in the next state as entity e', otherwise D(e, e') [2] = 1. The distance 
function D has been defined keeping in mind the use of these distances in defining recognition relation 
later. 

The Limits on Observable Mutations 

Let S mut = [0,0], indicating that syntactically different A terms (under a renaming) are treated as 
different entities. Also let S rep _ mut = [0, 1] indicating that reproductive mutations resulting into syn- 
tactically different term are not observable. This is primarily because under (3 reduction semantics of 
Alchemy, even changes in the syntactical representations result into very different reaction behaviors. 

Observing Self Replicating Hypercycles 

We can observe the self-replicating elementary hypercycles as sets of self-replicating entities. Let 
us define, for that purpose, the recognition relation Ra mut : E — ► E as follows: Ve, e' € E, R<5 mut 
(e) = e' <4> [F(e') = F(e) + 1] A D(e,e') < 8 mu t- Informally this means two entities in consecutive 
states are recognized same only using their tags. 

Lemma 4. Ri mut satisfies Axiom 1, Axiom 2, and Axiom 3. 

Proof. Axiom 1 and Axiom 3 are satisfied by definition. Axiom 2, which states that Ra mut is an 
injective function holds because of the specific construct of tagging mechanism and the definition of 
Distance function D which is such that two entities in successive states are recognized as same only 
when the difference between their third components of tags is 1, and we know that the observer selects 
new tags in such a way that this difference is 1 only when same entity was present in the previous 
state. □ 

Next let us defines A C E x E such that Ve, e' £ E.(e, e') e A D(e, e') < S rep _ mut - In order to 
define causal relation between entities in the AlChemy, we assume that observer has the knowledge of 
the reacting entities and the output term at any state. Therefore if entities e\ and &2 react in some 
state and yield e , the observer defines causal relation C so that (ei,e G ) € C and (e2,e e ) £ C with 
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F{ei) = F{e 2 ) = F{e a ) - 1. 



Lemma 5. Causal relation C defined above satisfies Axiom 4. 

Proof. First condition of Axiom 4 is satisfied by definition since F(e ) = F(e\) + 1 = Ffa) + 1. The 
second condition [/Be' G E.F{e\) = F(e') A Rd mut ( e ') = e o], that is, there does not exist any third 
entity e' in the previous state, which has mutated into e G , again follows from the specific construct 
of tagging as well as the distance function because as per the tagging mechanism explained before e 
being newly added entity in the chemistry will have the 3 rd component of its tag as 1 and all previously 
present entities, including ei,e2, in the chemistry would have their tags in new states updated such 
that their 3 rd components are always greater than 1. □ 

Relations AncestorOf and Parent can be defined same as in the framework. 

Lemma 6. Axiom of Reproduction and the Axiom of Fecundity are satisfied by the entities and 
corresponding abstractions discussed above. 

Proof. These two axioms depend upon the examples of self replicating A terms as well as elementary 
hypercycles. In case of hypercycles, the observer establishes multi-step reproduction using transitive 
closure of causal relation for each of the entities in the hypercycle. A quite well known example of 
self replicating A term is Xx.(x)(x) since (Xx.(x)(x))(Xx.(x)(x)) (Xx.(x)(x))(Xx.(x)(x)). Though 
in case of Alchemy, the level organization consists of self-copiers like Xx.x and hypercycles like 
{Xx1.Xx2.x2, Xx.x} as illustrated in Figure 3. As per the definition of causal relation, entity instances 
of Xxi . Xx2 -X2 and of Xx.x are causally related to past instances of each other and therefore of themselves. 

□ 

Mutations, Inheritance, and Natural Selection 

As emphasized in [FB94a , primary goal of AlChemy is to study alternative pathways in which higher 
level organizations (i.e., hypercycles, self maintaining organizations) can emerge starting with a random 
set of molecules. Therefore it appears that there is no explicit notion of mutations present in the 
chemistry. To see this notice that every new entity in the population is the result of reaction between 
two other entities. Therefore if one particular observer decides that one of the reacting entities is 
mutating into the resulting entity, it is still difficult to decide which of the two reacting entities should 
be considered as mutating into the new one. Even if such a view is adopted, the observer will observe 
that if a self-copying entity at any reaction step mutates into another entity then most often the new 
entity can no longer self-copy. Thus Axiom 7 (Preservation of Reproduction under Mutation) would be 
violated. Finally as discussed at the beginning of the section, owing to the focus of our framework on 
the evolutionary processes, self-maintaining organization of the kind that arise in AlChemy are beyond 
the scope. 

Conclusion 

Thus we have demonstrated that, based upon the knowledge of reacting terms and outputs, a precise 
observation process can be defined to work with AlChmey, which can be used to discover the self 
replicating A terms as well as hypercycles in the model. We also noted that mutations, heredity, and 
selection based axioms are not met in the chemistry where this limitation should be attributed to 
underlying reaction semantics of the chemistry as well as its design. This study highlights the fact 
that not all interesting dynamic processes are evolutionary in nature and therefore some of these non 
evolutionary processes are out of scope of the framework at present. 

4 Related Work 

Because of the presence of sufficiently many biology-specific criterion (e.g., morphological characters, 
bio-molecular structures etc.) to distinguish life from non-life, in biological literature there is little 
formal work on recognizing life per se. There is, however some recent work on defining and developing 
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methods to analyze genotype space structure based upon the macroscopic observations on phenotype 
characteristics (mainly mor phological and reactive characteristics) |JG98I [GCKV031 ILGC04] . 

To the authors' knowledge, there is not much work focussing on the observation process for ALifc 
studies reported in literature. Though there exist proposals to define 'numerical parameters' or 'statis- 
tics' |Bed99j to recognize life in a model. However, it is not clear whether there can be simple numerical 
definitions capturing the essence of life in arbitrary models and even if so does not seem to be the case 
with the existing proposals. The difficulty arises out of intricate nature of reproduction and selection 
inevitably involving non trivial identification of the population of evolving entities. Langton defined 
in [Lan90j a quantitative matric, called lambda parameter to detect life in any generic one dimensional 
cellular automata model based upon the characteristics of its transition rules. This lambda parameter 
based analysis is based upon the assumption that any self organizing system can be treated as living 
and does not consider population centric evolutionary behavior as characteristic of life. In BSP98 
there is a discussion on the classification of long term adaptive evolutionary dynamics in natural and 
artificially evolving systems. This they achieve by defining activity statistics for the components, which 
quantifies the adaptive value of components (characteristics in our model) . They employ similar mech- 
anism as of ours by associating activity counters (tags) with all the components present in the system 
during simulation. 

Self-reproduction, which has a long history of research starting from the late 1950s |Bur70l |Sip98[ 
IFM04] has evaded precise formal definition applicable to a wide range of models |ND98] in the sense of 
observable characterization of the reproducing entities. Though there is enough work on mathematical 
analysis of replication dynamics (fecundity) in various natural systems or the systems where environ- 
mental constraints governing the rate of reproductions are known (see for overview |FM04I Chap5].) 
In some of the discussions related to self-replication in cellular automata models |Say98, ,Mor98 , for- 
malizations of reproducing structures are presented, but they do not attempt to provide a general 
framework for observing reproduction or other components of evolutionary processes. These attempts 
at formalizing reproduction in CA models are reminiscent of our definition of entities (loops) in Sec- 
tion [3U 

In other work [Mis06b], we proposed a multi-set theoretic framework to formalize self reproduc- 
tion (with mutations) in dynamical hierarchies in terms of hierarchal multi-sets and corresponding 
inductively defined meta-reactions. The "self in "self-reproduction" was defined in terms of observed 
structural equivalences between entities. We also introduced constraints to distinguish a simple "col- 
lection" of reacting entities from genuine cases of "emergent" organizational structures consisting of 
semantically coupled multi-set of entities. 



5 Conclusion 

5.1 General Remarks 

This paper formalizes an implicit underlying component of ALife studies, namely the observation pro- 
cess, by which entities are identified and their evolution is observed in a particular ALife simulation. 
Under the assumption that the essence of life-like phenomena is their evolutionary behavior, we de- 
veloped a framework to formally capture basic components of evolutionary phenomena. This work, in 
essence, brings insights from evolutionary theory for real-life into the realm of artificial-life for defin- 
ing a formal framework for observational processes, which are needed for the identification of life-like 
phenomena in the ALife studies. We have argued that without such a formalism, claims pertaining to 
the evolutionary behavior in ALifc studies will remain inconclusive. 

We formally elaborate in algebraic terms the necessary and sufficient steps for an observational 
process, to be employed by an ALife researcher upon the time progressive model of his model universe, 
to uncover (hidden) life-like phenomena in the light of Darwinian evolution as defining characteristics 
of life. The observation process as specified in our framework may be carried out manually or can be 
alternatively algorithmically programmed and integrated within the model. 

To define inference process we specify necessary conditions, as axioms, which must be satisfied by the 
outcomes of observations made upon the model universe in order to infer whether life-like phenomena 



is present in the model (Section 2.2). These axioms also specify the experimental work necessary in 



order to observe and lay claims for the presence of life in the model universe. 



The case studies on Langton loops (Section 3.2) and Algorithmic Chemistry (Section 3.3) highlight 



the contributions that such an approach can make to the discussion of specific ALife experiments. An 
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important property of such a study is to make explicit "multi-level observations" , where entities and 
their relationship can be observed and defined on separate organizational levels. 

The framework design and the case study analysis also provide us clues for ALife research designs 
so that to be better able to witness evolutionary phenomena in the model during its simulations. This 
is discussed next: 



5.2 Design Suggestions for ALife Researchers 

As the framework is based upon the Darwinistic concepts of defining life in terms of evolutionary 
processes, the design suggestions we describe here are rather more suitable for those studies which aim 
to complement real life studies in an evolutionary framework. 

• Sufficient Reproduction with Variation: The model must be designed such that there ex- 
ist potentially large set of reproducing entities with significant variation in their characteristics. 
Quite often this hinges upon the choice of reaction rules or the semantics of the model and indeed 
it is a serious challenge for any model designer to define the reaction semantics which permits 
potentially large set of reproducers with significant variation. Another interesting aspect is that 
these reproducers must be relatively closely related to each other under the reaction semantics. 
This means that sufficiently many variations of reproducers should also be reproducers in them- 
selves otherwise the axiom of preservation of reproduction under mutation will not effectively 
hold in the model and most of the reproducers would have to appear de novo during simulations. 
We encounter this problem in both of the case studies discussed in Section [3] In case of Langton 
loops, any kind of change in the loop structure would cause caseation of replication. The work on 
designing Evoloops is therefore based upon the redefinition of the reaction semantics or transition 
rules which permit variation in replicating loops. Similarly in the case of Algorithmic chemistry, 
almost all of the single replicating A terms arise de novo and their variations do not replicate 
under (3 reaction semantics. 

• Measurable Rates of Reproduction: The model should be designed such that it is possible 
to impose some valid measure of determining the rates of reactions which in turn can be used 
to estimate differences in the rates of reproduction of different entities. This measurement of 
reproductive rates must be independent of the updation algorithm which selects entities for 
reaction. Therefore it can be argued that the models, where all (reproductive) reactions take 
place in a single step would be difficult to observe for natural selection, which works only when 
different entities reproduce at different rates. For example, it is not possible to infer differences in 
the rates of reproduction among different reproducing elementary hypercycles in the Algorithmic 
Chemistry consisting of the same number of A terms because every reaction between any two A 
terms occurs in a single step. On the other hand natural selection can be observed in case of 
Evoloops precisely because different types of loops consisting of different number of cells reproduce 
at different rates based upon the number of state transitions. 



5.3 Limitations 

The decision to equate life with evolutionary processes also excludes some of the interesting complex 
phenomena that are not evolutionary in nature from the scope of this work. Indeed, we have shown 



in Section 3.3 that the framework cannot account for the dynamic non-evolutionary behavior of Level 
1 and Level 2 organizations emerging in the Algorithmic Chemistry. We limit our attention to only 
those observations having evolutionary significance, though other observations can also be made upon 
the model including metabolism [BFF92 , emergence of complexity [AOC00J , self organization Kau93J , 
and autonomous and autopoitic nature of life |Zel81| etc. 

We have not placed direct emphasis on certain concepts widely associated with ALife studies in- 
cluding the notion of "emergence" . In our current setting the notion of "strong emergence" is only 
implicitly present and indeed "the element of surprise" BE97 often associated with emergence is not 
immediate in the framework. Similarly "the element of autonomy" of emergent processes with respect 
to the underlying micro-level dynamics is not addressed in our framework. Indeed, the spirit of the high 
level of observations and corresponding abstractions upon which the framework rests, may preclude 
such inferences. Nonetheless the idea of "weak emergence" |Bed97j . which lays emphasis on the simula- 
tions of the model for the emergence of high level macro-states is fundamental to our framework, where 
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the observation process is by default based upon the simulations of the model and not on analytical 
derivations. 

Another limitation of the framework in its current state is that it cannot be used effectively to 
make predictions regarding the possible observable evolutionary dynamics in a ALife model during 
simulations. This limitation though carries forward from the nature of Darwinian theory which is too 
generic in its conceptualization and based upon random sources of change that make it difficult to 
derive useful predictions. 

Similarly analysis of Godelian type conjunctures to counter possibility of strong Alife, stating the 
impossibility of formalizing life in general because that would imply formalizing "mathematically intel- 
ligent" entities like ourselves, which could in tern prove the Godel theorems in their own "mathematical 
universe" having correspondence with ours, is also beyond the scope of the current limits of the frame- 
work. See [IIl97llRas92j . 



Problem of False Positives 

Terms 'false positive' and 'false negative' are used in general to highlight the limitations of 'observation 
- inference' based methodologies. False positive refers to a situation where observations and consequent 
inferences on a model result into a claim of the presence of certain property in the model which actually 
does not exist, while false negative is used to refer the situation where observations do not yield required 
support for the presence of certain property, which is actually present in the model. False negatives 
are usually the result of incomplete observations while false positives indicate arbitrariness in the 
observation/inference process. 

Like any other generic specification framework, current framework also suffers from the weakness of 
administering false positives. False negatives are also possible, whereby an observation process is defined 
such that it does not infer evolution, even though there might actually be evolution present in the model. 
The case of false negatives, however will not concern us since our focus is to establish the presence of 
evolution in a given ALife model and not whether it is absent with respect to certain observations. 
The problem of false positives stems due to the fact that the framework permits arbitrariness in the 
definition of entities and their causal relationships. In case of causal relationships, they are defined in 
the framework as observation dependent and might not be consistent with the underlying micro-level 
dynamics of the model (Section 2.2.2| . This arbitrariness might give rise to false claims on the presence 



of evolution in the model though there might be none actually. 

For example, an observer (say ob) might decide to "ignore" entities in some states in the beginning 
and then choose later on to observe them in some other states so that to use them for establishing (false) 
evolutionary relationships, which would not have been possible had he not preferred to ignore them 
earlier. This problem of selectively observing entities in various states requires additional constraints 
in the framework. We may add the following constraint by considering another observer ob' with same 
universe of observation as ob. Let us consider a particular simulation of a model as a state sequence 
T. For a state subsequence S of T, let Ef b and Ef b , denote the set of entities observed by ob and ob' 
respectively. Consider that ob' observes some entities X C E^ b ,, which were ignored by ob, that is, 
X % E^ b . Now consider the case when ob chooses to observe X in some later subsequence S' of T, 
5^5", that is, X C E^ b , and also X C E^ b ,, where E^ b , and E^ b , are the sets of entities observed by 
ob and ob' in S' . Now if ob establishes evolutionary relationships using entities in X, which cannot be 
established by ob', then we say that ob has drawn illegitimate conclusions. 



5.4 Further work 

Framework can be further extended in several interesting directions, including the following: We need 
to capture the essence of strong emergence by considering several observation processes at different 
organizational levels of the model. We can also study overlapping evolutionary processes - examples 
from real life include co-evolution, and sexual selection versus environmental selection. Framework 
ought to be extended so that fruitful predictions for a given ALife model regarding the nature of 
evolutionary dynamics can be made. We also need to introduce more strict constraints to overcome 
the problem of false positives by limiting as to what could be claimed as observed. Further insights can 
be gained by applying the framework to novel classes of ALife models to refine the framework further, 
which we are currently involved with. 
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